Gene Franean1_4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4376 
Symbol 
ID5672729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5221516 
End bp5222703 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content68% 
IMG OID641243245 
Productintegrase catalytic region 
Protein accessionYP_001508662 
Protein GI158316154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGGCA CGAATGGGGT GTGGTCGCTG CTCTACGCCC TGACACGCAA CGCTCTCGGA 
CTGATGCTGC TCCGGGTCCG TGGGGACACC GCGAAGGACG TGGAGCTCCT TGTCCTGCGA
CATCAGGTGG CGGTACTGCG ACGGCAGGTG AACCGTCCGA CGTTGGAACC GGCGGATCGG
CTGATCCTCG CGGCGCTGTC CCGGCTGCTG CCCCGGGCCC GCTGGGGTTC GTTCTTCGTC
ACCCCCGCCA CCGTGCCGCG CTGGCACCGG GAACTCCTCG CACGCCAATG GACCTACCCG
CGGAAGTCGC CTGGGCGGCC ACCGGTCCGC CGGGAGATCC GCGAGCTGAT CCTGCGCCTC
GCACGGGAGA ACCCGACCTG GGGCCACCGC CGGATCCACG GCGAGCTCGT CGGGCTGGGT
TACACGGTCG GGGTCGCCAC TGTCTGGCGG ATCCTGCACC GCGCCGGTGT CGACCCCGCA
CCCCGCCGGG CCGACACCTC CTGGCGCACG TTCCTGTCCG CCCAGGCCTC CGGCCTGCTG
GCCTGCGACT TCTTCACCGT GGACACCGTG TTCCTCCAAC GGATCCACGT GCTCTTCGTC
GTCGAACACA CCACCCGCCA CGTCCACGTC CTCGGGGCCA CGAAACACCC GACCACGGCG
TGGGTCACCC AGCAGGCACG GAACCTGCTG ATGGACCTCG ACGAGCGTGG CCACCGGTTC
CGGCTCCTCA TCCGTGACCG CGACACGAAA TTCACGGCCT CGTTCGACGC TGCCTTCGCC
GGGGCCGGCA TCGACGTGAT GCGCACACCG CCACAGTCAC CGAAAGCGAA CACGATCGCG
GAACGCTGGG TCGGCACCGT CCGCCGCGAA TGCACCGACC GACTACTGAT CGTCTCCGAA
CAGCACCTCA CGTCGGTCCT CAGCAGCTAC GCCAAGCATT TCAACACCCA CCGACCCCGC
CGCTCCCTCC ACCAGCACCC ACCCGACCCG CCACCGATGG TCACACCGAC CCCGGAGTCC
GCCGTCCGTC GCACACGCAT CCTCGGCGAC ATGATCAACG AGTACCGCAA CGCCGCCTGG
CGACGCCCCC AAACGATCAC GTCAGCTGCA AAAGAGCAGA TCAGAGGCCG AAACCCAAGT
TCTGGAGCCC CACACCCTCG TTGCTGTTCT CGGGCACGAT TGCTGTAG
 
Protein sequence
MSGTNGVWSL LYALTRNALG LMLLRVRGDT AKDVELLVLR HQVAVLRRQV NRPTLEPADR 
LILAALSRLL PRARWGSFFV TPATVPRWHR ELLARQWTYP RKSPGRPPVR REIRELILRL
ARENPTWGHR RIHGELVGLG YTVGVATVWR ILHRAGVDPA PRRADTSWRT FLSAQASGLL
ACDFFTVDTV FLQRIHVLFV VEHTTRHVHV LGATKHPTTA WVTQQARNLL MDLDERGHRF
RLLIRDRDTK FTASFDAAFA GAGIDVMRTP PQSPKANTIA ERWVGTVRRE CTDRLLIVSE
QHLTSVLSSY AKHFNTHRPR RSLHQHPPDP PPMVTPTPES AVRRTRILGD MINEYRNAAW
RRPQTITSAA KEQIRGRNPS SGAPHPRCCS RARLL