Gene Franean1_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3822 
Symbol 
ID5672186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4540063 
End bp4541799 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content72% 
IMG OID641242701 
ProductRhs element Vgr protein 
Protein accessionYP_001508121 
Protein GI158315613 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0443202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCC CCGAGACTTA CGGCGCGCTG CCCGTCCTCT ACCTGGACGG GAAGCCGGTG 
CCGCCCGCGA TCAAGGAGAC TATCCTGCGG GTCGTCGTGG ACAGCGACGT CGCGGCGCCG
GACGCCTGCC GCGTGGTACT CAATGACCCC GGCCGGGACG TGCTCGCGGC GGCCGGGTTC
GACTTCCGCC ACGCGCTGAA GGTCACCGCG CCGCCGAGGG CCACCCCCGA GGGCGGGGGC
GCGGAGAAGG TCCTCTTCGA GGGCACCATC TACAGCCTCG GCTTCGGCTA CGACGAGCGG
GGCGCGACCG CTGTCGTGGT GGCCTACGAC AGCTCATACG CTCTGTTCAA CGGCGTGCAC
ACGGCCACAT ACCACAACGT CACCGACTCC GACCTGGTGA CGAAGATCGC CCGCGAGCTG
AGCATAGACA CCGGCACGAT CAATCCGACG ACGGTCGTCC ACGAACACGT CGGCCAGGTC
AACGAGACGC ACTGGGACTT CCTCACCCGA AGGGCCAGAG AGGTCGACCA CGTGCTCCGG
GTACGGGACA ACAAGATGGA GTTCGTCCGT CCCACCGCCG CCGACGACGC GCCCAGGCCG
GGCAACTTCG ACAGCCCGAG CCACCTGCAG CTCACCCCAG GTGGCGACCT GGACGTCTTC
ACCGCCCGGG TGACCGCCGC GCAGCAGGTG TCCGAGGTCG AGGTCCGCGG CTGGGACGAC
CGGGGCAAGC GCGAGCTGGT GGCCACCGCA CGGGCGAGCA CTCGCGCCGC GCAGATCAAG
GACGACCCGG CCGACCTGGG GGCGGGCAAC TCCTCCGCCC GGTACGTCGC TCCCGCCCGG
CCGCTCGCGA CGCAGGCCGA GTGCGACGCG ATGGTCGCCG CGGTCGCCGA GCGGATCGCC
AGCACCTCGG TCGCCGCCGA GGGCGTGGCC CACGGAGATC CGCGCATCCT CGCCGGTGTG
GCGCTGAGCG TGGGCCGGAC CGGGGGGAGC TTCGACGGCA AGCTCACCGT CTCCCACGCC
GAGCACGTCT TCGACCACGC CAGCTACCGC ACCCGGTTCA CGGTGAGCGG GCCGCACGAC
CGGTCCCTGC TCGGTCTGGC CTCGGCCGCC GGCGCCCGGC AGAGCAGCCC GCTGATCGCC
GGTGTGGTGC CGGCCGTCGT CTCCAACATC AACGATCCGG AGTCCCGCTG CCGGGTGCGG
GTGAAACTGC CCTGGCTGTC CGCGGACTAC GAGACCGACT GGGCCCGGGT CGCGATCGCC
GGCGGCGGCC CGGACCGCGG GATGCTGGTG CTGCCCGAGG TCAACGACGA GGTGCTGGTC
GCTTTCGAGC AGGGTGACCC GCGCCGCCCG TTCGTGCTCG CCGGCCTGTA CAACGGTGTG
GACGCGCCGC CCTTCGGCGG CGGCGTCGAC ACCGCCGCCG GCACCGTCGT CCGGCGCGGC
CTGCGCACCC GGAAGGGCCA CGAGATCGTG GTCAGCGACG CCGACGGCGA CGAGCACGTG
GAGATCCGCA CCCGGGACGG CAAGGTGCGG ATCCGGCTCG ACCACGACCA GGGCGGGCTC
ACCATCGAGA CCGACGCGGA CATCGACGTC CGGGCCAAGG GGAAACTGTC GCTCACCGCC
GAGCAGGACC TGACGATCTC CGCGCGGGGT ACCGGGTCGA TCTCTGCTGA CGCCGGCCTC
ACCCTGTCCA GCCGCGCGGA CGTCACAGTG CAGGGAAACC CGATCAAGCT CAACTGA
 
Protein sequence
MATPETYGAL PVLYLDGKPV PPAIKETILR VVVDSDVAAP DACRVVLNDP GRDVLAAAGF 
DFRHALKVTA PPRATPEGGG AEKVLFEGTI YSLGFGYDER GATAVVVAYD SSYALFNGVH
TATYHNVTDS DLVTKIAREL SIDTGTINPT TVVHEHVGQV NETHWDFLTR RAREVDHVLR
VRDNKMEFVR PTAADDAPRP GNFDSPSHLQ LTPGGDLDVF TARVTAAQQV SEVEVRGWDD
RGKRELVATA RASTRAAQIK DDPADLGAGN SSARYVAPAR PLATQAECDA MVAAVAERIA
STSVAAEGVA HGDPRILAGV ALSVGRTGGS FDGKLTVSHA EHVFDHASYR TRFTVSGPHD
RSLLGLASAA GARQSSPLIA GVVPAVVSNI NDPESRCRVR VKLPWLSADY ETDWARVAIA
GGGPDRGMLV LPEVNDEVLV AFEQGDPRRP FVLAGLYNGV DAPPFGGGVD TAAGTVVRRG
LRTRKGHEIV VSDADGDEHV EIRTRDGKVR IRLDHDQGGL TIETDADIDV RAKGKLSLTA
EQDLTISARG TGSISADAGL TLSSRADVTV QGNPIKLN