Gene Franean1_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3803 
Symbol 
ID5672167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4512279 
End bp4513421 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content73% 
IMG OID641242682 
Producthypothetical protein 
Protein accessionYP_001508102 
Protein GI158315594 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.544146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCACGA AGTCCGGGAT CGCCGTGGCG GTCGCGACGG TCGTGCTCGT CGTGGCCGGC 
TCCGTGCTGG ACTATCCGGA GCTGCTCGCG CTCGGCTTGG CGGCCGGTGT CGCCCTGCTG
TTCGCGGCCG GCTGGATGCT GGTCACCCCG GATGTCACCC TCTCCCGGGA GATCCACCCG
CCGCGGGTCT TCGAGGGGGA CGGCGCCCGC GCCCTGATCG CGGTGACGAA TGCGGCCCGG
CGGCGCAGCC CGCCCATCCT CGCCGCCGAG TCGGTGGGCG ATCGCACGGT CGCGGTGGCC
CTGCCGAGCC TGGCACCGGG CAGCCGTTTC TCGGCAACCT ACCCGCTGCC GACCGACCGG
CGCGGCGTCT TCGAGGTCGG GCCGCTGGTC GTGGGCCACA GCGACCCGCT GCGGCTGTTG
CACGTGGGGC GGGCTTTCCC GTCCCGGTCG ATGCTGCGGG TGCACCCGCG GATCCATCCG
GTGGGCCCCC TGCCGACCGG TGGTTCGCCC GATATGGACG GCCCGACCAG CGCGACCGCG
CCGCAGGGCG GGGTGGCGTT CCACAGCCTG CGCGAGTACG TGCGTGGCGA CGACCTGCGG
CTGATCCACT GGCGGTCGAC CGCGCGCAGC GGACGGATGA TGGTGCGCCA CAACGTGGTG
CCGAACGAAC CCCGGATGAT GGTCGTGTTG GACACCAGTG AGTCGCCGTA CCAGGGCGAC
TACTTCGAGG ACGCGGTCCG GGTCGCCGCA TCGCTGGCGG TGTCCGGCTG CCAGCGCGGC
TTCCCGGTCG AGTTGCACAC CACCGGTGGG ACGCGGGTGG TCGCCGAGAG CGGACAGGAC
ACCACCAGCG TCCTCGATGC CCTCGCCGGC GTCCGTCCCG GACCGGACGA CCCCGGGCTC
ACGGCGCTGC TGCGCATGGT TCCGCGCGAG GAGGGCGCCG CGCTCGGCGT GGTGACGGGG
CAGCCGCCGG GGGCGAAGAT CTCCGTCATC TCGGCGGTTC GAGCCCGGTT CGCGATGGCG
AGCCTGGTCT GCGTCGGGGA GGAGCACGGT CGTCCCGGGC CTCCCGTCCG CGGGGCGCTG
GTGGTGAACG TCCGCACCAG CACGGACTTC GCGTCCGTGT GGAACGCGTC GGTGCGCCGA
TGA
 
Protein sequence
MITKSGIAVA VATVVLVVAG SVLDYPELLA LGLAAGVALL FAAGWMLVTP DVTLSREIHP 
PRVFEGDGAR ALIAVTNAAR RRSPPILAAE SVGDRTVAVA LPSLAPGSRF SATYPLPTDR
RGVFEVGPLV VGHSDPLRLL HVGRAFPSRS MLRVHPRIHP VGPLPTGGSP DMDGPTSATA
PQGGVAFHSL REYVRGDDLR LIHWRSTARS GRMMVRHNVV PNEPRMMVVL DTSESPYQGD
YFEDAVRVAA SLAVSGCQRG FPVELHTTGG TRVVAESGQD TTSVLDALAG VRPGPDDPGL
TALLRMVPRE EGAALGVVTG QPPGAKISVI SAVRARFAMA SLVCVGEEHG RPGPPVRGAL
VVNVRTSTDF ASVWNASVRR