Gene Franean1_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1800 
Symbol 
ID5670202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2162137 
End bp2163366 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content77% 
IMG OID641240721 
Productaminotransferase class V 
Protein accessionYP_001506144 
Protein GI158313636 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.345473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAC GCGGCCCCGC AACGTCCGGC CAGCTGCTCC GGGACGGGCG TAGCGTCACT 
CTCGTGCCGG CCTATCTCGA CCACGCTTCG ACCACGCCGC TGCATCCGGT CGCCCGGGAG
GCCCTGCTCG CCGCGCTCGA CGACGGCTGG GCCGACCCCG CACGGCTCTA CCGCGAGGGC
CGCCGCGCGC GGATGCTCCT CGACGCCGCC CGGGAGACCG TCGCCGGCGT CCTCGGCGCC
CGGACCAGCG AGATCAGCTT CACCGCGAGC GGGACGGCCG CGGCGCACCA GGCGCTGCTC
GGCACCGCCG CCGCCCGCCG CCGCGCCGGG CGGGTGGTTG TGGTCAGTGC CGTGGAACAC
TCCAGCGTCC TGCACGCCGC GCAGCGCCAC GAACGTGCCG GCGGCGAGGT CGTCACGATC
GGTGTGGACG GCCTCGGCCG CGCCGACCCG GCCGCGTTCG AGGCCGCGCT CGACGCCCAC
CCGGGAACGG CCGTGGCCGC CCTGCAGCAC GCCAACCACG AGGTGGGCAC CGTCCAGCCG
GTCGCGGCGG TCGCGCGGGC GCTGCGCCGG CGCGGGGTGC CGCTGCTCAC CGACGCGGCG
ACGACGGTCG GGCGGGTTCC CGTCGACCTC GCCGAGCTGG GCGCGGACCT GCTCACCGCG
AGCGCACACA AGTTCGGCGG GCCGCCCGGG GTGGGCATCC TCGCCGTCCG CACGGGCACC
AGGTGGGCCA ACCCGCTGCC GGCGGACGAG CGGGAGCACG GGCGGGTTCC CGGCTTCCCG
AACGTTCCCG CCGTCGTCGC CACGGCTGCC GCGCTCGCCG TGCGCGCCAC CGAGATCGAC
GCGGAGGCGC CCCGGCTCGC CGGCTACACC GAACGCCTCC GCCGGCGCCT GCCGGAGCTC
GTCGAGGACG TCGAGCTGCT CGGCCCCGGC GGCGCCGACC CGGCGGTGGG ACTGCCGCAC
ATCGTGGCCT TCTCATGCCT TTACGTCGCG GGCGAGGCAC TCCTGGACGA GCTCGACCGT
GCCGGCATCG CCGTCAGCTC CGGGTCGAGC TGCACCTCGG ACACCCTGAC GCCCAGCCAC
GTCCTGGTGG CGATGGGCGC GCTGACCCAC GGCAACCTCC GCGTGTCGTT CGGGCGGGAC
TCCACCGACG CCGATCTGGA GGCGCTGCTC GACGCGCTGC CGCCCGCCGT GCGCGCCGTC
CGCGAGCGCG CCGGGGCGGC AGGCCTGTGA
 
Protein sequence
MPPRGPATSG QLLRDGRSVT LVPAYLDHAS TTPLHPVARE ALLAALDDGW ADPARLYREG 
RRARMLLDAA RETVAGVLGA RTSEISFTAS GTAAAHQALL GTAAARRRAG RVVVVSAVEH
SSVLHAAQRH ERAGGEVVTI GVDGLGRADP AAFEAALDAH PGTAVAALQH ANHEVGTVQP
VAAVARALRR RGVPLLTDAA TTVGRVPVDL AELGADLLTA SAHKFGGPPG VGILAVRTGT
RWANPLPADE REHGRVPGFP NVPAVVATAA ALAVRATEID AEAPRLAGYT ERLRRRLPEL
VEDVELLGPG GADPAVGLPH IVAFSCLYVA GEALLDELDR AGIAVSSGSS CTSDTLTPSH
VLVAMGALTH GNLRVSFGRD STDADLEALL DALPPAVRAV RERAGAAGL