Gene Franean1_3124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3124 
Symbol 
ID5671502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3679244 
End bp3680332 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID641242021 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001507441 
Protein GI158314933 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.45434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCACC GCAAGAAGAT CGCGTTAGCG GCGTTAGTCG CCGTGTCCGT CATGATGCTC 
GCTGCCTGCG GGTCGGGCAC TGGATCCGAG GCGGACGGCG AGTCGAAATC GATAACTTTC
ATGCTGCCCA CCCCGTCGTG GGACGTGTCC CTAGCGGCTT TCGCCGTGGC GCAGGCGAAG
GGGTACTTCG CAGAGGAGAA CCTCAACGTG AAGTACGTCC TGACCAAAAG CAGCCAGCTC
GCGGCCACGA CCGTCGCTCA GGACACCGAC TCCGTCGGGC TGGTCAGCCC AGAGCCCGTT
ATGATCGCAG CTCAAACAGG TAAAGGTCTG GGGCTGGAGT ACTTCTATAA CTTCTTCCGT
CGGCCGATCT ACAATCTGGC GGTCCTAGCA GACAGCGAGA TAAAGGACCT CCACGGCCTC
CAGGGCAGGA AGGTCGGCGT TCAGAGCCTA TCCGCGGTTG GCGTCTACTA CGTCAAGGCC
TACCTAGCCG AGGCCGGGCT CGCCCCTGAC ACCGTGACGC TCATCCCGAC CGGTAGTGGG
ACGCAGGCAC TCACGGCACT GGAGGGCAAG CAGGTCGATG CCATGCTGGT CAACGACGTC
TGGCCGGCGC AGTGGAGGAA CGCCGGGGTC GAGACCCGCA GCATCTCGAC GACGAGCCAA
CTGTCGGTCG TCCAGCACGG GCTGCTGACG AGAGCGGAGA ATCTGAGGAA TAACCCTGAC
ACGTATGCGG CGCTCGGGCG CGCCGTCGCC AAGGCTACGT TGTTCACCCT CGAAAACCCC
GAGGCGGCCA TCACCATGAT GTGGAAGGCA CAGCCCGAGA CGAAGCCGAC CGGAATCGAT
GACACGGAGG CCATGAGGCA GAGTCTCATG ATTCTCAACG CCCGGATCCC GAACCTGGAG
CTTGGTGCCG GCGAGACGAT GTGGGGCCAG TACCCGGACC GTGCCTTCGC CGACTCCGTC
AAGTTCGCGA CCGACAGCGG CCTGATCACC AAGGATATCG ACCCGAACGT CTTGTCCACA
AACGACCTAG TGGTGAAGAT CAACGACTTC GATGCGGCTG CGGTCACGGC TGACGCGGAC
AGCAGTTGA
 
Protein sequence
MRHRKKIALA ALVAVSVMML AACGSGTGSE ADGESKSITF MLPTPSWDVS LAAFAVAQAK 
GYFAEENLNV KYVLTKSSQL AATTVAQDTD SVGLVSPEPV MIAAQTGKGL GLEYFYNFFR
RPIYNLAVLA DSEIKDLHGL QGRKVGVQSL SAVGVYYVKA YLAEAGLAPD TVTLIPTGSG
TQALTALEGK QVDAMLVNDV WPAQWRNAGV ETRSISTTSQ LSVVQHGLLT RAENLRNNPD
TYAALGRAVA KATLFTLENP EAAITMMWKA QPETKPTGID DTEAMRQSLM ILNARIPNLE
LGAGETMWGQ YPDRAFADSV KFATDSGLIT KDIDPNVLST NDLVVKINDF DAAAVTADAD
SS