Gene Franean1_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4201 
Symbol 
ID5672556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5002147 
End bp5004084 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content77% 
IMG OID641243074 
Productalpha/beta hydrolase domain-containing protein 
Protein accessionYP_001508491 
Protein GI158315983 
COG category[I] Lipid transport and metabolism 
COG ID[COG0657] Esterase/lipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.538787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.933486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGG TTTTCGCCGC GCCGGCTTGC CCGGCCGCCG CGGCCGGGTC ACACTGCGAC 
CCAGTGGCCA CGCCCACACC CCGCGCCGCC CGCTCCCACC CGGCGACGCG CGTGGTGATT
GGGGAGGGGG AGCATGTTCG GTCGCCGCAG GACCACCGTG GACGAGCATC CATCCGACCA
CGCGGCGCGC GGGCCGGCAC CTGCCGCCTT GGACGGCCCC GGCCCGTACG ACGTCTCCGA
GCTCGCCTGG CCCGGCGAGG CAGGCGAGGC GGCGCACGAG GCGCCGACCA ACCCGCCCGG
CGACGAGGCC GGTTCCGGCG ACGCCGTCCC GGGTGTCGGA GCGGTACCCG GTGGGGCGGG
GCGGCGCGCG CCCGAGATTC TCGACCTCGG CGCGCTGCGC GTCGCGGTGC CGCCGGGAGT
GACCGTGCGG ATCGCCCGCG GCACGCAGAC CGGGCCGGGC ACCGACCTCG TCCTGACGGC
CGCCGGGCTC ACCGTGCGGG TGACCGTGTT CGCCGCGCCG ACCAGCGGGC AGCTGTGGGC
CAGAGTACGC GCCGAGCTGG CCGCCGGCCA CCCGGAGTCG GAGGTCCGCG ACGGCCCCCA
CGGCCCGGAG CTGCTGCTCC CGGGCTGCCG GGTCGTCGGA GTCGACGGCC CGCGCTGGTT
CCTGCGGGCG GTCGTGACCC CGGCCGACGC CGACGGGGCC GACGAGATCC TGCGCGGCCT
CGTCGTCGTA CGCGGCCCCG GTGCCATGCC GGCGGGCACG GCCCTGCCGC TGCTGCCGGT
CGGGCCGGCC GGCAGCCGCC CGAACGCCGG AGCCCTCGAC GACCTGTGGA CCGACGAGCC
GCTGACGGCC GGCACCGGAA CCCTCGAGGG TCGCCGCGCC GTGCACGAGT ACGCGGCGGC
CTTCACCGGC GACCTCAGCC GCAACCTGTC GACCTGGGGC TGAAGCCCCG CCGGGCTGTG
GGGCATGATG CAGCCGGGGG TGGGTCGTGC CATGCCCGTC TCCCCCGTCT CCCGACCCGA
CTCGACCTTG ATGGCGGCGG TGTGCGCACC GATCAGGACC TGGCCTACGC GACGCTGTCC
GACGCGCAGC GTCTCGATCT GCTCCTGCCC ACGGGCGGCG GCGATCCGCC TCCGGTCGTC
GTCGCGATCC ACGGCGGCGG GTTCGCCGTC GGCGACAAGC AGGACATGGC CCGCACCGCG
CACGCGCTCG CCGGCGCGGG CTACGCGGTG GCCAGCGTCA ACTACCGGCT CTCCGGTGAG
GCGGCCTTCC CGGCGGCGGT CGCCGACGTC CGGGCCGCGG TGCGCTGGCT GCGGGCGAAC
GCGCGCCGCC TCGGGCTGGA TCCGGCCCGG ATCGGGGTGA TCGGCGAGTC GGCCGGCGGC
TATCTCGCCG CCATGCTCGG CGCCGCCGGC GACGACCCGC TGCCGGGGGA CGTCGACCTG
GGCCCTGCCG TCGGCCTGGA CCCTGGCGTG GACCTCGGCC CGGCCGGAGC GCGGCCGTCC
AGCGCGGTGC GGGCGGTGGT CGACCTGTAC GGCCCGGTGG ACTTCTCGAC CATGGACGCC
CAGCTGCGCG CGAATCCGCG CTGCCCGGCC CGGGCCGCCT CGCACGACCG CGCGGACTCC
CCCGAGTCGC GTTTCCTCGG CGCGCAGATC ACCGCCGCGT CGGAGCTGGT GCGCCTGGCC
AGCCCGCTGT CCCACCTGCG CCGTGACCGC CCGCCGCCGC CGTTCCTGAT CGAGCACGGC
GACACCGACT GCACCGTCCC CTACCAGCAG TCGCAGCAGC TCGCGGACGG CCTGTGCGCC
GCCGGCGGGT CGGTCGAGCT CACCCTGCTG CGGGGGGTGG GCCACGGCGG GGCCTTCCCG
CTCGCCGAGC GCCTGCCAGG CATCATCCAG TTCCTGGACC GCGCTCTGGA CCGCGCTCTG
GACCACGCCC CGCGCTGA
 
Protein sequence
MTRVFAAPAC PAAAAGSHCD PVATPTPRAA RSHPATRVVI GEGEHVRSPQ DHRGRASIRP 
RGARAGTCRL GRPRPVRRLR ARLARRGRRG GARGADQPAR RRGRFRRRRP GCRSGTRWGG
AARARDSRPR RAARRGAAGS DRADRPRHAD RAGHRPRPDG RRAHRAGDRV RRADQRAAVG
QSTRRAGRRP PGVGGPRRPP RPGAAAPGLP GRRSRRPALV PAGGRDPGRR RRGRRDPARP
RRRTRPRCHA GGHGPAAAAG RAGRQPPERR SPRRPVDRRA ADGRHRNPRG SPRRARVRGG
LHRRPQPQPV DLGLKPRRAV GHDAAGGGSC HARLPRLPTR LDLDGGGVRT DQDLAYATLS
DAQRLDLLLP TGGGDPPPVV VAIHGGGFAV GDKQDMARTA HALAGAGYAV ASVNYRLSGE
AAFPAAVADV RAAVRWLRAN ARRLGLDPAR IGVIGESAGG YLAAMLGAAG DDPLPGDVDL
GPAVGLDPGV DLGPAGARPS SAVRAVVDLY GPVDFSTMDA QLRANPRCPA RAASHDRADS
PESRFLGAQI TAASELVRLA SPLSHLRRDR PPPPFLIEHG DTDCTVPYQQ SQQLADGLCA
AGGSVELTLL RGVGHGGAFP LAERLPGIIQ FLDRALDRAL DHAPR