Gene Franean1_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4202 
Symbol 
ID5672557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5004146 
End bp5005735 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content74% 
IMG OID641243075 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_001508492 
Protein GI158315984 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.359651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.965318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCGC TCGTCACCGT GCTGGTCCTG GCCGTGCTGA CGGCAGTGGC GGTCGGTGCG 
GCGCTGCTGG CCGCGGCCGG CTGGTGGGCC GCCGCCGCGG TGGCCGGCGC GCTGCTGGCC
GTGGCCGTGC ACGACGTCGT CCAGCGCCAG CACGCCATCC TGCGCTACCA CCCGGTGCTC
GGGCATGCCC GGTTCCTGCT CGAGACCATC CGTCCGGAGA TCCAGCAGTA CTTCGTCGAG
CGCAACTACG ACGGCCGCCC GTTCGACCGT GCCACCCGGA CGATGATCTA CCAGCGGGCC
AAGGGCACCC ATGGGGACCA GGCCTTCGGT ACCGAGCGCG ACGTCAACGA GGTCGGCTAC
GAGTACCTCC CGCACTCCAC GGCGCCCGCG CCGGTCGCCC CCGGCTCCGC GCCGCCGCGG
GTGCGCATCG GCGGGCCGGA CTGCACCCAG CCCTACGACA TGGCGCTGCT CAACGTCTCG
GCGATGAGCT TCGGGTCGCT GTCGGCCAAC GCGGTGCTGG CGATGAACCG TGGCGCCGCG
GCCGGCGGCT TCGCGCACGA CACCGGCGAG GGCGGGCTCA CCGAGTACCA CCTGCGCCAC
GGCGCCGACC TGGTCTGGGA GATCGGCAGC GGCTACTTCG GCACCCGCAC CCCCGACGGT
GACTTCGACC CGGCCCGCTT CAAGGACGTC GCCGCGCTGC CCACGGTGCG GATGGTCGAG
CTCAAGCTGA GCCAGGGCGC CAAGCCCGGC CTCGGCGGCG TGCTGCCGGC GGCGAAGGTC
ACCGCGTCGA TCGCCCGGGC CCGCGGGGTC CCCGAGGGGG TCGCGTGCAT CAGCCCGTCG
TTCCACCGGG TCTTCGCCAC CCCGCGCGAG CTCGTCCTGT TCGTCGGGCG GATGCGCGAG
CTCGCCGGCG GCAAACCGGC CGGGTTCAAA CTGTGCGTCG GGTCGCGCCG CGAGCTGCTC
GCGATCTGCC GGGCGATGGT GGAGGAGGGG ATCACCCCGG ACTTCATCGT GGTGGACGGC
TCGGAGGGCG GCACCGGCGC GGCGCCGCTG GAGTACGAGG ACCACGTGGG CACCCCGCTG
ACGGAGGGCC TGATCACCGT GCACAACGCG CTGGTCGGGG TGGGCCTGCG CGACCGGGTC
CGGATCGGGG CCGCCGGCAA GGTCGCCAGC GGGGTGGACG TCGTCAAGCG GCTCGCGCAG
GGCGCGGACT ACACGAACGC GGCCCGGGCG ATGATGATGG CGGTCGGCTG CATCCAGGCC
CAGCGCTGCC ACACCAACAC CTGCCCGGTG GGGGTCGCCA CCCAGGACCC GCGCCGCGCC
CGCGCCCTCG ACGTCACCGA CAAGAGCGAA CGGGTCCGCC GCTACCAGGC GGCGACCGTC
GCCCAGGCGG TGCAGGTGAT GGCCTCGCTT GGCTGCACCG GGCCCGAGCA GCTCCACCCG
GGGATGCTCA TGCGCCGCGT CACCCACACC GACACCCGCA GCTACGCCGA GCTGTACGAG
TGGCTCGAAC CCGGTGAGCT GCTCGCCGAG GCGCCGCGTT CGTGGGCGGC CGACTGGGCC
GCCGCCGACC CCGACAGCTT CCGTCCCTGA
 
Protein sequence
MLPLVTVLVL AVLTAVAVGA ALLAAAGWWA AAAVAGALLA VAVHDVVQRQ HAILRYHPVL 
GHARFLLETI RPEIQQYFVE RNYDGRPFDR ATRTMIYQRA KGTHGDQAFG TERDVNEVGY
EYLPHSTAPA PVAPGSAPPR VRIGGPDCTQ PYDMALLNVS AMSFGSLSAN AVLAMNRGAA
AGGFAHDTGE GGLTEYHLRH GADLVWEIGS GYFGTRTPDG DFDPARFKDV AALPTVRMVE
LKLSQGAKPG LGGVLPAAKV TASIARARGV PEGVACISPS FHRVFATPRE LVLFVGRMRE
LAGGKPAGFK LCVGSRRELL AICRAMVEEG ITPDFIVVDG SEGGTGAAPL EYEDHVGTPL
TEGLITVHNA LVGVGLRDRV RIGAAGKVAS GVDVVKRLAQ GADYTNAARA MMMAVGCIQA
QRCHTNTCPV GVATQDPRRA RALDVTDKSE RVRRYQAATV AQAVQVMASL GCTGPEQLHP
GMLMRRVTHT DTRSYAELYE WLEPGELLAE APRSWAADWA AADPDSFRP