Gene Franean1_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1045 
Symbol 
ID5669459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1225088 
End bp1226158 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content76% 
IMG OID641239974 
ProductLAO/AO transport system ATPase 
Protein accessionYP_001505407 
Protein GI158312899 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.172221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG GCTCCGCCGG GGTCGCTCCA GCCGGGCCGG CGAGATCCGG GCCGGCGATA 
GCCGGGCGGG CGCGCGCCGC GCGCCCGAGC CCGGCGGAGC TCACTACCGC CGCCCTGGCC
GGTGACCGGC GCGCGGTGGC CCGCCTGATC TCGCTCGTCG AGGACGAATC TGACGCCCTG
CGCGAGGTGA GCGCCCTGCT TGCCCCGCAC ACCGGCCGCG CCCGGGTGAT CGGTCTGACC
GGGGCTCCCG GGGTGGGGAA GTCGACGTCC ACCTCGGCGC TGGTCGGCGC CTTCCGGGCT
CGCGGGCTGC GGGTGGGCGT GCTCGCGATC GACCCGAGCT CCCCGTTCAC CGGCGGCGCG
CTGCTGGGCG ACCGGGTCCG GATGGTCGAG CACGCCACCG ACCCGGATGT GTTCGTCCGC
TCCCTGGCCA CCAGGGGCAA CCTCGGCGGG CTGTCCTGGG CCACCCCACA GGCGCTGAGG
GTCCTCGACG CGGCCGGCTT CGACATCGTG CTGATCGAGA CCGTCGGTGT CGGCCAGGCC
GAGGTGGATG TCGCCTCGCT GGCCGACACC ACGCTGGTCC TGCTCGCCCC GGGCATGGGG
GACGGGATCC AGGCGGCCAA GGCCGGCATC ATGGAGATCG CCGACATCCT CGTCGTCAAC
AAGGCCGACC GTCCCGGCGC CGACCACACC TACCGCGACC TTGTCGCCGC CGTCCGGATG
GCCGGTGGCA CGGCGGCCGG TGGGGCGGCG GAAGCCGGCT GGCGGCCCGA GGTCGTGCGG
CTCGAGGCCG CGACCGGGAA GGGCGTGCCG GAGCTCCTGG ACGCGATCGA GCGCCACCGC
GACTGGCTGC GGACGTCCGG TGAGCTCGAA CGCCGCCGGC TGCACCGCGC GGCCGAGGAG
ATCTCCCAGA TCGCCCTGGC CGGCATGCGG GCCCGGCTGG GCAGGCTCAA CGGCGCGGCC
CAGCTGGCCG ACCTGGCCCG CCAGGTCACC TCCGGCCGCC TCGACCCCTA CACCGCGGCC
GCCACCCTCC TGGCCGCCAT CCCCGACCCC CACCTCCCGC GCTCCGGGTG A
 
Protein sequence
MSAGSAGVAP AGPARSGPAI AGRARAARPS PAELTTAALA GDRRAVARLI SLVEDESDAL 
REVSALLAPH TGRARVIGLT GAPGVGKSTS TSALVGAFRA RGLRVGVLAI DPSSPFTGGA
LLGDRVRMVE HATDPDVFVR SLATRGNLGG LSWATPQALR VLDAAGFDIV LIETVGVGQA
EVDVASLADT TLVLLAPGMG DGIQAAKAGI MEIADILVVN KADRPGADHT YRDLVAAVRM
AGGTAAGGAA EAGWRPEVVR LEAATGKGVP ELLDAIERHR DWLRTSGELE RRRLHRAAEE
ISQIALAGMR ARLGRLNGAA QLADLARQVT SGRLDPYTAA ATLLAAIPDP HLPRSG