Gene Franean1_6333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6333 
Symbol 
ID5675785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7692045 
End bp7693166 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content70% 
IMG OID641245185 
Producthypothetical protein 
Protein accessionYP_001510580 
Protein GI158318072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCGG GCAGGATCAG CGCCGCGGCC TCGGTTCCCG GCACGACGGG GTCCGCCGGG 
CTGCGCGGAC GATCGCAGTC CTCTCCGACG ACCACACCCG CTCGGTCTAC GGCCAGGCCG
AATGGCAGGC CGCCTGGACT GCGGATCCCT CCGGGGCGGC ACACGCTCCT CGTGATGCGG
GTAGCTGACT GCGACCGGCC CGGCCTGCTC GCCCAACTGG TATCCACCGA CCTGTTCGCA
CACCCGGAAC GGCAGGCACG GGTGGAACTG CTCCGCGCGG CGACGCTCGC TCGCTCCGGT
GGCCGGGCGA AACCGCAGCA CGCTCCACCG TTCCCACCTG ATCGGCGGGC GATGCCCGTG
CGGCCGTCGT TCCCTGGCGG CCGGCCGGAC GTGTGGAACG TCCCGCCACG GCCGGCTCAC
TTCGTCGGCC GTACCGATCT TCTCGATCAG GTTCACCAGC AGCTCGGCGA CGCGGGCGCG
GTCGCGGTCT GCGCCCTGCA CGGCCTCGGC GGGATCGGGA AAACCGCCCT CGCGATCGAG
TACGCCCACC GCCACCCGCA CAGCTACGAC CTGATCTGGT GGATAGCTGC CGAAGACCCA
CAGCTGATCC CGGGGCATGT GTCCACCCTC GGCCGGGAAC TCGGCCTGCC CGACGGCGCG
GACTGGCCCG CCGTCCTCAA CGTACTGCGA CGCGAACGGC TGCGTTGGCT GCTCATCCTC
GACAACATCG AAGACCGGAA CGTCATCAGC CCGTTCCGGC CGACGGATCA CCTCGGCCGG
CTACTCGTCA CCTCACGACG CACCGGCCTC GACGCTTTCG GTCCACAGCT CGCCGTACCC
GAACTTCCCC GACGAGACGC GGTCGACCTG CTCACCCGGC GGGTACCCGC CATCGACACC
GGGACGGCCG GGCAGATCGC GGAACTTCTC GGAGACCTGC CCCTAGCCGT GGAACAAGCC
GCCGGCTACC TCACCCAAAC CGGCATGCCC TCCGACGACT ACGTCGAGCT ACTCCGCGGG
CGGCTCGGGG AGATGCTGCA CCGCGGGTGG GTCGCCGACC GGCCCGACAT CACCACCGCC
AACCTGTGGA ACCTGTCCCT GACCCAATGC CGTGAAGCTT AA
 
Protein sequence
MRAGRISAAA SVPGTTGSAG LRGRSQSSPT TTPARSTARP NGRPPGLRIP PGRHTLLVMR 
VADCDRPGLL AQLVSTDLFA HPERQARVEL LRAATLARSG GRAKPQHAPP FPPDRRAMPV
RPSFPGGRPD VWNVPPRPAH FVGRTDLLDQ VHQQLGDAGA VAVCALHGLG GIGKTALAIE
YAHRHPHSYD LIWWIAAEDP QLIPGHVSTL GRELGLPDGA DWPAVLNVLR RERLRWLLIL
DNIEDRNVIS PFRPTDHLGR LLVTSRRTGL DAFGPQLAVP ELPRRDAVDL LTRRVPAIDT
GTAGQIAELL GDLPLAVEQA AGYLTQTGMP SDDYVELLRG RLGEMLHRGW VADRPDITTA
NLWNLSLTQC REA