Gene Franean1_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1659 
Symbol 
ID5670061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1983500 
End bp1984666 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID641240577 
Productglutamine--scyllo-inositol transaminase 
Protein accessionYP_001506003 
Protein GI158313495 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.66477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.520835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA GTCGACGCCC GGCGCCCGAG TTTCCCGCCT GGCCGCAGTT CGACGGCGCG 
GAACGTGACG GACTGATCCG CGCGCTGGAA CAGGGCCAGT GGTGGCGCAT GGGCGGTGGC
GAGGTCGACG CGTTCGAGCG GGAGTTCGCC GAGTACCACG GCTCGCGCTA CGCCCTCGCG
GTGACGAACG GCACTCACGC GCTCGAGCTG GCGCTGCAGG TGCTCGGGGT CGGCCCCGGC
ACGGAGGTCA TCGTCCCCGG GTTCACGTTC ATCTCCTCGT CGCAGGCGGC CCAGCGGCTC
GGTGCAGTCG CCGTCCCGGT CGACGTCGAC CTCGACACCT ACTGCATCGA CCCGGCGGCG
GTGGAGACCG CGATCACACC GCGGACGAAG GCGATCATGC CGGTCCACAT GGCCGGTCAC
CTGTCGGACA TGGACGCGCT CGCCAAGATC TCCGCGAACA CCGGTGTCCC GATCATCCAG
GACGCCGCGC ACGCCCACGG CACGCAGTGG CAGGGCCGCA AGGTCGGCGA GCACGGCAGC
GTCGCGGCCT TCAGCTTCCA GAACGGCAAG CTGATGACGG CCGGCGAGGG CGGTGCGGTG
ACCTTCCCGG ATGCCGAGCT GTACGAGCAG GCGTTCCTGC GGCACAGCTG TGGCCGTCCG
CGGACGGATC GCCGCTACTT CCACCAGACG TCCGGCTCGA ACTTCCGGAT GAACGAGTTC
TCGGCATCCG TGCTGCGCGC GCAGCTCGCC CGGCTCGGCG GCCAGATCGA CACCCGTGAG
CAGCGCTGGC CGGTGCTGTC CGGCCTGCTC GCCGAAATCC CCGGCGTGGT CCCGCAGAGC
CGCGACCCGC GCTGTGACCG CAACTCGCAC TACATGGCGA TGTTCCGGGT GCCCGGGTTC
GGCGAGGAGC GGCGCAACGC CCTCGTCGAC GCGCTGATCG AGCGCGGCCT GCCGGCGTTC
GCGGCGTTCC GCTCGATCTA CCGCTCGGAC GGCTTCTGGG AGACCGGCGC GCCCGACGAG
ACCGTGGACC AGATCGCCGC GCGCTGCCCG AACGTCGAGG CGCTGAGCGC GGACGGCATC
TGGCTGCACC ACCGCACGCT GCTCGGCACC GAGGAGCAGA TGCACGACAT CGCGGCGATC
ATCGCCGAGA CGCTGGCGGA CGCGTGA
 
Protein sequence
MSDSRRPAPE FPAWPQFDGA ERDGLIRALE QGQWWRMGGG EVDAFEREFA EYHGSRYALA 
VTNGTHALEL ALQVLGVGPG TEVIVPGFTF ISSSQAAQRL GAVAVPVDVD LDTYCIDPAA
VETAITPRTK AIMPVHMAGH LSDMDALAKI SANTGVPIIQ DAAHAHGTQW QGRKVGEHGS
VAAFSFQNGK LMTAGEGGAV TFPDAELYEQ AFLRHSCGRP RTDRRYFHQT SGSNFRMNEF
SASVLRAQLA RLGGQIDTRE QRWPVLSGLL AEIPGVVPQS RDPRCDRNSH YMAMFRVPGF
GEERRNALVD ALIERGLPAF AAFRSIYRSD GFWETGAPDE TVDQIAARCP NVEALSADGI
WLHHRTLLGT EEQMHDIAAI IAETLADA