Gene Franean1_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3866 
Symbol 
ID5672229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4595342 
End bp4596736 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content68% 
IMG OID641242744 
Producthypothetical protein 
Protein accessionYP_001508164 
Protein GI158315656 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.679852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACCCC GTAGAACTGT CGCGGCCGTC GCTGCCCTCG CAGCGGTGAC CGTCTTCGTC 
GCAGGATGTG GCCGCGACTC GGGCGGCGAA TTAGGGAGTG AGGACACTCC CGCCACCCAG
GGGTCCACGG CCGCCGCAGC GGGTGACTTC GGCGACCTGA AGGACGTCTG TGGCACCGGC
GACCCGAAGG GCGCGCCCGC TCAGGGCGTG ACCGCGAGCG AGATCAGCGT CGGTGTCTTC
AGTGACGTCG GCTTCACCAA GAACTCTGAG TTCGACGACG CCGCCAAGGT GTTCACCTCC
TGGTGTAACG AGGCCGGCGG CATCAACGGT CGCAAGATCG CCTACAACCT GCGGGACTCG
AAGATCTTCG AGACCCGCCA GCGGATGATC GAGTCCTGCC GCGAGGACTT CGCGATCGTC
GGCGGTGGCA GCGCGCTCGA CGCCGGCGGC GTGGAAGAGC GCCTCAAGTG CCTGCTCCCC
GACATCCCCG CGCAGACGAG CCAGCCGGAG AACATCGGCT CGGACCTGCA GATCGACGCG
ATCGGCGCCG GGCACTCCTA CATCCGCTAC GCCGGTTACT TCAACTGGCT GCTGAAGGAG
GCCTACCCGG CCTCCGCCGG TGCGGTCGGC ATCATCGCCG GTGACTCCCC GGTGACCAAG
GTCATCGGGG ACCAGACGGT GGAGGCCGTG CAGAAGGCCG GCGGGACGGT CGCCTACAAC
GACCTCTACC CGGCGGCCGG CGTCTCGGAC TGGACGCCCT ACGCCCAGGC GCTCAAGAGC
AAGGGCGTGA AGGGCCTGGT CTTCCAGGGC GACTTCCGCA GCCTCGCGAA GCTCGAGCAG
GTGCTGTCGT CGATCGACTA CAAGCTCGAC TGGATCGACG CCAACAGCAA CGCCTACGGA
TCCGCGTTCG TGGAGCTCGC CGGCGACGCC ATCAGCACCC AGAACAACCT GGCCGACCTC
AGCGGGGTCG CGCCTCTCGA GGTGGCGGAC GAGATCCCCG CCGTCCAGAA GGTCCTGGAC
CTCTACAAGG AGTACGCGCC CGACGCGGAG GTCACCTTCC CGGCACTGCG CGCCTTCTCG
TCCTGGCTGC TGTTCGCGGA GTCGGCCAAG GAGTGCGGGG ACGACCTCAC CCGCAAGTGC
CTCTACGACA CGGCGCGCGA GCAGACCAAG TGGACTGCCG GTGGCCTGCA GGCCTCGGTC
GACATCACCA AGGCCGACGC GCCGCTGAAG TGCTTCAACG TCGTGCAGGC GAGCGCGGAC
GGCTGGAAGC CCGCGGACTT CGAGCCTGAC ACCGGTGTGT TCCGCTGCGA TGCCCCTTCC
GTCAAGTACA CGGGCTCGTA CGGCACGCCG CTCACCCTCG CCAGCGTCGG CAAGAGCCTG
AGCGACCTCA AGTAA
 
Protein sequence
MRPRRTVAAV AALAAVTVFV AGCGRDSGGE LGSEDTPATQ GSTAAAAGDF GDLKDVCGTG 
DPKGAPAQGV TASEISVGVF SDVGFTKNSE FDDAAKVFTS WCNEAGGING RKIAYNLRDS
KIFETRQRMI ESCREDFAIV GGGSALDAGG VEERLKCLLP DIPAQTSQPE NIGSDLQIDA
IGAGHSYIRY AGYFNWLLKE AYPASAGAVG IIAGDSPVTK VIGDQTVEAV QKAGGTVAYN
DLYPAAGVSD WTPYAQALKS KGVKGLVFQG DFRSLAKLEQ VLSSIDYKLD WIDANSNAYG
SAFVELAGDA ISTQNNLADL SGVAPLEVAD EIPAVQKVLD LYKEYAPDAE VTFPALRAFS
SWLLFAESAK ECGDDLTRKC LYDTAREQTK WTAGGLQASV DITKADAPLK CFNVVQASAD
GWKPADFEPD TGVFRCDAPS VKYTGSYGTP LTLASVGKSL SDLK