Gene Plav_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1231 
Symbol 
ID5454612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1366218 
End bp1367309 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID640876801 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001412508 
Protein GI154251684 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.388459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC GTTTCAAAGC TGCCGCCAAG ACGAACGGCG AGGGTTCTGC CGCGTCCGCC 
CGCAAGGGTG GTGCCGATGA TCATGCGATC GTCGGACGCC GTATCAGCCA GGAAGCGCTG
AAGCGTCTGA GCGAGCTCCA CACGCGCTAT CTCAAGGGCA TCCCGAACGG CTGGTGCGCG
GTGATGAAGG AGTGCGATCT GTCGGGTCTC GATTTTCGCA ACCTCAACTT CTCGCATGGT
CATTTCATCG GCTGCGACTT CACAGGCTGC GACCTTGAAG ACGCGCATTT TTCGGGCGCC
AATCTTTTCA GCGCAAATTT CGACCATGCG AATCTCACAC GCACCAATTT CTCGCGCGCG
GATTTGCGGG GCGCGAATTT CGAAGATGCC GAAATGGCGG ATGCACAGCT CGATGGTGCC
GACCTGCGGC GTGGCGCGGT GATAAGGCGC GGCGCCTCGG CACCTGTTGG CCGCGAGAAT
TCGAGCTTTC GCGGTGCACG GATGTACGGC ACCAACATGG CCGAATGCAA ACTTCTCGAC
GCCGATTTCG AAGGGGCCTC TATCTCCGGC GCTAGCCTGC AAGGTGCCGA TCTGCGGGGT
GCGAACTTTG CGGGTGCCGA GCTCAAGGGC GTCGAATTGT CGGGGGCTAA TCTCGCCGAT
GCGGATTTCC GCCGCGCCGT CATGGACGAG GCGACAATCG CGCGCGGCGA CATGATGCGG
GCGACCAGGC CGAGGCCGGC GCCCAATCCC GAACGCATGG AAAAAATACT GGCGCTTCAT
CTCGAGTGGA TCCAGACCGG CCAGCAAAAA GGCCAGCGCG CCGATTTCAC CCGGATGGAT
CTCTCGCGAA AGGATTTCTC CAGGGCCGTG CTTGCCGGGG CCCATTTCCG TGAGGCCATC
CTCGCCGATG CAAATTTCGA AAAGGCGATC CTTGCCGCCG CCGATTTCAG CAATGCGATC
CTGTTTCGCG CCAACCTCGC CGGGGCCGAT CTCCGGGGCG CCGATCTCAG GGGTGCCGAT
CTGAAGAATG CCCGGCAGGA TGACACCAAG AAGGGCGAGC TGGACGGCAC CAGCCTGGCC
ACCAGGCTCT GA
 
Protein sequence
MSERFKAAAK TNGEGSAASA RKGGADDHAI VGRRISQEAL KRLSELHTRY LKGIPNGWCA 
VMKECDLSGL DFRNLNFSHG HFIGCDFTGC DLEDAHFSGA NLFSANFDHA NLTRTNFSRA
DLRGANFEDA EMADAQLDGA DLRRGAVIRR GASAPVGREN SSFRGARMYG TNMAECKLLD
ADFEGASISG ASLQGADLRG ANFAGAELKG VELSGANLAD ADFRRAVMDE ATIARGDMMR
ATRPRPAPNP ERMEKILALH LEWIQTGQQK GQRADFTRMD LSRKDFSRAV LAGAHFREAI
LADANFEKAI LAAADFSNAI LFRANLAGAD LRGADLRGAD LKNARQDDTK KGELDGTSLA
TRL