Gene Franean1_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1336 
Symbol 
ID5669747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1608465 
End bp1609562 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID641240267 
Producthypothetical protein 
Protein accessionYP_001505694 
Protein GI158313186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA CCACGTCCAC GGCCGCGACA CCCTCGACGT TCGACATCGT CGCCGCGCGC 
TTCCCCCTCG TTCCCCGCTC CCGGCCATCC TGTCCACCCC TTGATGCCCG AATCGCCCAC
GTCGCCGCCC TCGCCGGCCA AGCCGCCGGG GGCGGCGGCG ACGCGCTGCT GCGCGCGGCC
GAGGCGCACA ACCTCGCAGC GCTGATCGCC AGCGACTGCG GACTGCCTGA CCTCGCACGA
AGCCTCTGCT GGCGCCAGAT CGACACCCTC CCGCTCCGCC GCCCACTCGA CGGAGCGACG
GCGAAACTCG CCCTCCAACC GTTCATCAAC CTGGCGCGCC TGCGGCTGCG CGCGGGCGAT
GGCCTGGCCG CGTACCAGAT GCTCACGACG CTCTACGACG TCGTCGTGGC GAGGACCAGC
ACCGCCATCG ACGAACGAGC ACTCGTGTTC GACGACCTCG TCACCGATGT CGACCACCCA
CAGACCGTCC GCTGGCTGTG GACCGTCCTG CTCGCCGACG GCACCCGCGC CCTGACCCGA
ACCGGCCACT GGACCGAGGC CCTCGACCAC CTCAACCGCC ACAAGGGCAT CGGACAGCGC
CTCCTCGACG GCCGCCAGAC CGCGATCCTT GCCCACCACG CCCACCGAGA CCATTACGCC
GCCGAGCACC TGCTCACCAC CACAGCCACC ACCCAGCCCT GGGAGCAATC CGTCGCCACC
TGCCTCGGCC TCCTCCACAG ACACCTCACA GGTCTCAAGA CCCCTGACGA CGGCAGGAGC
ACGATCGATG CGCTCCTCCC GTCGAACAAC CCCGAGCACC TGACGTTCAA CATCCAGCTC
GGCCTGTGCC TCCTCGACCT CGCGGACACC CCCCAGCATC TGAGGCCGGT CCTCGACACG
ATCATCGACG GCGCCCTGCA CAGCGACGAT GCCTACGCCG CCCGAGACCT ACTCACCCAC
CCAGCCGCCC GCGGGTACCT CAACCGCGAC CAGCTGACGC TTCTGAACGA AAGACAGCGA
CACTCCGGGC TCGGTAGCGG CCGCATCCCC GAAGCGCTCC GCACACGGCT CCTCGGCGCC
CTCGCGCTCG CCAGCTGA
 
Protein sequence
MAATTSTAAT PSTFDIVAAR FPLVPRSRPS CPPLDARIAH VAALAGQAAG GGGDALLRAA 
EAHNLAALIA SDCGLPDLAR SLCWRQIDTL PLRRPLDGAT AKLALQPFIN LARLRLRAGD
GLAAYQMLTT LYDVVVARTS TAIDERALVF DDLVTDVDHP QTVRWLWTVL LADGTRALTR
TGHWTEALDH LNRHKGIGQR LLDGRQTAIL AHHAHRDHYA AEHLLTTTAT TQPWEQSVAT
CLGLLHRHLT GLKTPDDGRS TIDALLPSNN PEHLTFNIQL GLCLLDLADT PQHLRPVLDT
IIDGALHSDD AYAARDLLTH PAARGYLNRD QLTLLNERQR HSGLGSGRIP EALRTRLLGA
LALAS