Gene Franean1_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3002 
Symbol 
ID5671385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3531611 
End bp3532813 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID641241905 
Producthypothetical protein 
Protein accessionYP_001507325 
Protein GI158314817 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCTG CTCTTCTGCT CCTCACAGGA GCCTGCTCTG CCTCGGACGT TTCACCGGCA 
CCACGTGACT GCGTCGTGTC CCCGGGAGTT ACCAAGGACA CGGTCAGGCT CGGCCTGATA
CTGACGGACA CCGGGACTAC AGCCAAACTG TTCATCGGTG CCCGCGCTGG TATCGATGCG
CGAATCCGGT CTCAGAACGA GAGGGGCGGG GTGCGCGGCC GCACACTCGT CTATGACTGG
CGGGACGACG AGTCGAATCC AGCTCAGAAC CTGGCCGTTG CGAGAGAGCT GGTCGAGAAC
GGCAAGGTCT TCGGCCTGCT CAGCGCCACA AGCGTGGCTA CTGGCTCGGC CCAGTACCTT
CACGGCGCCA GAGTACCGGT CGCTGGCCTT GCGATGGAAT CGGTATGGTC GACCTTCGAC
AATATGGTCA GCTACATGAA CCAGATGCCG TCGGCGGTGG CGTTCGACAC CCTCGGGCGA
TTCAGCAGCG CTATGGGTGT CCAGCGCGCT GTCATCGTCA TGACCGGCTC CTCCGAGACG
TCCCGGGCCG GGGCGACCTG GATCGCGAAA ATCCTGCAAT ACTCCGGAAT AGGGATCGCG
GCAACCCTAG ACTACTCGCC GGCAGCGATC ACGCCGGCAC TGCTAGGACG GCAGATAGCA
CAGCTCAGAG CCGACGGGCT GTTCGTCTCG ATGCCCGGCG ATGACTTCTC CGACATTTAC
TACGGCGCGA TGACTGCTGG CGCCGCCTTC AAGGTTGGAC TCGGAGTCCA CGGCTACGGC
CATGAGCTCC TGGCTCGGAA TGGAACGAAG ATCGGCGGTG CATACTTCTA TGTCCCATAT
CTCCCCTTTG AGGCGAACGC ACCCGCACAG CGCGCATATC TCGATGCGGT CACGCGGTAC
GCGCCGGAAC TCAACCCCCC CGAGGCTCAG GCGGCCGTCG AGTCCTATAT CACCACCGAC
CTGCTGATCA GAGGGCTCGA GGCTGTGGGG CCGTGCCCTA CACGCGACGG TCTGCTCGGG
GCACTGAGGT CGATCTCGGA CTTCGATGGC TCGGGTCTGC TGCCCGTACC CATCGATCTC
ACGCAGGGTT TCGGTCAGCC AGGCCGCTGC CTCACCTTCG TACGCGTCAA TCAGGCGGGC
GCGGGATTCG ATGTCATGAA GCCACCGGTG TGCGGCGAGC TCATTCCCAG TCCGACCCCG
TGA
 
Protein sequence
MSAALLLLTG ACSASDVSPA PRDCVVSPGV TKDTVRLGLI LTDTGTTAKL FIGARAGIDA 
RIRSQNERGG VRGRTLVYDW RDDESNPAQN LAVARELVEN GKVFGLLSAT SVATGSAQYL
HGARVPVAGL AMESVWSTFD NMVSYMNQMP SAVAFDTLGR FSSAMGVQRA VIVMTGSSET
SRAGATWIAK ILQYSGIGIA ATLDYSPAAI TPALLGRQIA QLRADGLFVS MPGDDFSDIY
YGAMTAGAAF KVGLGVHGYG HELLARNGTK IGGAYFYVPY LPFEANAPAQ RAYLDAVTRY
APELNPPEAQ AAVESYITTD LLIRGLEAVG PCPTRDGLLG ALRSISDFDG SGLLPVPIDL
TQGFGQPGRC LTFVRVNQAG AGFDVMKPPV CGELIPSPTP